A Lexicon Driven Method for Unconstrained Bangla Handwritten Word Recognition
نویسندگان
چکیده
In this paper a lexicon driven segmentationrecognition scheme for unconstrained Bangla handwritten word recognition is proposed for Indian postal automation. In the proposed method, at first, binarization of the input document is done and slant correction of the individual words is performed. Next, using water reservoir concept words are pre-segmented into possible primitive components (characters or its parts). In order to merge these primitive components into characters and to find optimum character segmentation, dynamic programming (DP) is applied using total likelihood of characters as the objective function. To compute the likelihood of a character, modified quadratic discriminant function (MQDF) is used for the purpose. The features used in the MQDF are mainly based on the directional features of the contour points of the components. We tested our system on Bangla city-name images and at present an overall accuracy of 87.21% is obtained from the proposed system.
منابع مشابه
A Lexicon Driven Approach for Off-line Recognition of Unconstrained Handwritten Korean Words
We propose a new method for the recognition of unconstrained handwritten words consisting of Korean and numeric characters. To overcome the difficulty in separating touching characters, we adopt an oversegmentation technique and we find the optimal segment combination using a lexicon-driven word scoring technique and a nearest neighbor classifier. The optimal combination gives the final segment...
متن کاملA lexicon-driven approach for optimal segment combination in off-line recognition of unconstrained handwritten Korean words
We propose a new method for o!-line recognition of unconstrained handwritten words consisting of Korean and numeric characters. To overcome the di$culty in separating touching characters, we adopt an over-segmentation strategy. Given a slice of the input word image, we "nd the optimal segment combination using a lexicon-driven word scoring technique and a nearest-neighbor classi"er. The optimal...
متن کاملیک روش دو مرحلهای برای بازشناسی کلمات دستنوشته فارسی به کمک بلوکبندی تطبیقی گرادیان تصویر
This paper presented a two step method for offline handwritten Farsi word recognition. In first step, in order to improve the recognition accuracy and speed, an algorithm proposed for initial eliminating lexicon entries unlikely to match the input image. For lexicon reduction, the words of lexicon are clustered using ISOCLUS and Hierarchal clustering algorithm. Clustering is based on the featur...
متن کاملEfficient Word Segmentation Driven by Unconstrained Handwritten Phrase Recognition
An e cient system which nds the best match between an input image and a lexicon is presented. To capture writing style of spacing between words and characters prime stroke analysis based on statistical methods is introduced. A method for estimating bound on number of characters without actual recognition is also presented. For system efciency, before actual recognition, classi ed groups of word...
متن کاملAn improved offline handwritten character segmentation algorithm for Bangla script
Effective segmentation of offline handwritten word images of unconstrained handwritten Bangla script is a challenging problem in Optical Character Recognition (OCR) application. Presence of a continuous horizontal line called ‘Matra’ is an important feature of this script. However, in unconstrained cursive handwriting, Matra can be wavy or discontinuous, makes the problem of segmentation diffic...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2006